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A NOTE ON CONVEX CHARACTERS, EIBONACCI NUMBERS AND 
EXPONENTIAL-TIME ALGORITHMS 

STEVEN KELK AND GEORGIOS STAMOULIS 


Abstract. Phylogenetic trees are used to model evolution: leaves are labelled to represent 
contemporary species (“taxa”) and interior vertices represent extinct ancestors. Informally, 
convex characters are measurements on the contemporary species in which the subset of species 
(both contemporary and extinct) that share a given state, form a connected subtree. Given 
an unrooted, binary phylogenetic tree 7” on a set of n > 2 taxa, a closed (but fairly opaque) 
expression for the number of convex characters on 7” has been known since 1992, and this is 
independent of the exact topology of T. In this note we prove that this number is actually equal 
to the (2n—l)th Fibonacci number. Next, we define Qki'T) to be the number of convex characters 
on T in which each state appears on at least k taxa. We show that, somewhat curiously, g 2 {'T) 
is also independent of the topology of 'T, and is equal to to the {n — l)th Fibonacci number. As 
we demonstrate, this topological neutrality subsequently breaks down for k > 3. However, we 
show that for each fixed k > Qki'T) can be computed in 0{n) time and the set of characters 
thus counted can be efficiently listed and sampled. We use these insights to give a simple but 
effective exact algorithm for the NP-hard maximum parsimony distance problem that runs in 
time ©(0^ • n^), where 0 1.618... is the golden ratio, and an exact algorithm which computes 

the tree bisection and reconnection distance (equivalently, a maximum agreement forest) in time 
©(0^^ ■ poly(n)), where (jp ^ 2.619. 


1. Introduction 

Phylogenetics is the science of accurately and efficiently inferring evolutionary trees given only 
information about contemporary species HU. An important concept within phylogenetics is con¬ 
vexity. Essentially this captures the situation when, within a phylogenetic (i.e. evolutionary) tree, 
each biological state emerges exactly once: it should not emerge, die out, and then re-emerge. 
More concretely, given a phylogenetic tree and a set of states assigned to its leaves, can we assign 
states to the internal vertices of the tree such that each state forms a connected “island” within 
the tree? If this is possible, the assignment of states to the leaves is known as a convex charaeter. 

In this article we present a number of results concerning the enumeration of convex characters. 
In Section [U we give formal definitions and describe relevant earlier work. In Section [3] we start 
by showing that an earlier result counting convex characters can be simplified to a term of the 
Fibonacci sequence. We then seek to count convex characters with the added restriction that each 
state should occur on at least k leaves, proving the somewhat surprising result that (as for fc = I) 
tree topology is irrelevant for fc = 2, and that a formulation in terms of Fibonacci numbers is again 
possible. We give an explicit example showing that for fc > 3 the topological neutrality breaks 
down. In Section U we show that for all fc the size of the space can be counted in polynomial 
time and space using dynamic programming, which also permits listing and sampling uniformly 
at random, noting also that non-isomorphic trees can have exactly the same vector of space sizes 
(for fc = 1, 2,...). In Section [S] we give a number of algorithmic applications for NP-hard problems 
arising in phylogenetics that seek to quantify the dissimilarity of two phylogenetic trees. Finally, 
in Section |6] we briefy discuss a number of open problems arising from this work. The software 
associated with this article has been made publicly available. 

2. Preliminaries 

For general background on mathematical phylogenetics we refer to mM- An unrooted binary 
phylogenetic X-tree is an undirected tree T = (V(T),E(T)) where every internal vertex has 
degree 3 and whose leaves are bijectively labelled by a set A, where X is often called the set of 
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taxa (representing the contemporary species). We use n to denote |X| and often simply write tree 
when this is clear from the context. 

A character / on A is a surjective function f : X ^ C for some set C of states (where a state 
represents some characteristic of the species e.g. number of legs). We say that / is an r-state 
character if |C| = r. Each character naturally induces a partition of X and here we regard two 
characters as being equivalent if they both induce the same partition of X. An extension of a 
character / to E(T) is a function h : E(T) —!> C such that h{x) = f{x) for all x in X. For such an 
extension h of /, we denote by lh{T) the number of edges e = {m,u} such that h{u) ^ h{v). The 
parsimony score of a character / on 7 ”, denoted by lf{T), is obtained by minimizing IhiT) over all 
possible extensions h of f. We say that a character / : A —>■ C is convex on T if Ifi'T) = |C| — 1. 
Equivalently: a character / : A —> C is convex on 'T if there exists an extension h oi f such that, 
for each state c G C, the vertices of T that are allocated state c (by h) form a connected subtree 
of T. We call such an extension h a convex extension of /. See Figure [T] for an example. The 
convexity of a character can be tested in polynomial mil] ( in fact, linear m) time. 




Figure 1 . For the given tree T (on 7 taxa) there are 233 convex characters 
in total, but only 8 in which each state appears on at least 2 taxa, and these 
are shown above. 1 character uses exactly 1 state {abcdefg), so g 2 {T, 1) = 1, 4 
characters use 2 states {ab\cdefg, abc\defg, abcd\efg and abcde\fg), so g 2 {T, 2 ) = 

4 and 3 characters use 3 states (ab\cd\efg, abc\de\fg and ab\cde\fg), so g 2 iX, 3) = 

3. For each character we have shown an extension verifying that the subtree 
induced by each state is connected i.e. that the character is convex. 

We write g{T,r) to denote the number of r-state convex characters on T and gi{T,r) {i > 1) 
to denote the number of those characters that have the additional property that each state used 
by the character appears on at least i taxa. It follows from the definition of character that 
(/i(T, r) = g(T,r). We define: 

n 

S*(T) = '^gi{r,r). 

r—1 

The value gi{T) is therefore equal to the total number of convex characters on T. For the tree 
shown in Figure (TJ g{T) = gi{T) = 233 and 32 (T) = 8 . We adopt the standard convention that 
the binomial coefficient (^) evaluates to 1 if fc = 0, and 0 if n < fc and fc > 0. In [T3] it is proven 
that, for n, r > I, 

g{T,r) = gi{r,r) = ^ 


A NOTE ON CONVEX CHARACTERS, FIBONACCI NUMBERS AND EXPONENTIAL-TIME ALGORITHMS 3 


Hence, 


( 1 ) 


51 (T) = 


r—1 


2n — r — 
r—1 



As observed in m, the expression for gi (T, r) (somewhat surprisingly) does not depend on the 
topology of T, only on the number of taxa n. Hence we can write g{n), gi(n) and gi{n, r) without 
ambiguity. 


3. Fibonacci numbers and convex characters 


Theorem 1. The value 32 ( 7 ”, r) only depends on n (i.e. the topology of T is not relevant) and 
for n > 2 , r > 1 is given by the expression 


52(T,r) 



Proof. We prove this by induction on n. For the base case note that for n S {2, 3,4} there is only 
one binary tree topology (up to relabelling of taxa) possible on n taxa and that the expression 
correctly evaluates to 1 when r = 1 and, when r >2, evaluates to 0 in all cases except n = 4, r = 2 
when it correctly evaluates to 1. 

Consider then n > 5, and let r be any value 2 < r < [n/2j. (For r = 1 the expression correctly 
evaluates to 1, and for r > [n/2j the expression correctly evaluates to 0). Every tree on 4 or 
more taxa contains at least one cherry: two taxa x,y that have a common parent u where the 
third neighbour of u is an interior vertex. Fix such a cherry. (A similar technique is used in [13] 
and M)- Observe that any convex character f on P with the property that each state appears 
at least twice, has f{x) = f{y). This follows from the connected-subtree definition of convexity. 
Now, let T' = T|(A1 \ {x}) and let T" = T|(X \ {x, y}), where T\X' denotes the tree (on the set 
of taxa X') obtained from T by taking the minimum subtree connecting the elements of X' and 
then suppressing vertices of degree 2 . 

There are two cases to distinguish. The first case is when the state f{x) = f{y) does not appear 
on any other taxa. There are g 2 {T", r—1) such characters. The second case is when f{x) = f{y) 
does appear on at least one other taxon. There are 52 (T',r) such characters. Hence, 

( 2 ) 52 (r,r)=g 2 (r",r-l)+ 52 (r',r). 


By the inductive hypothesis we have 

.- 7 - X ({n-‘^)-{r-l)-l\ (in 
g2{J,r) = ' 


(r-l)-l 
n — r — 2\ f n — r — 2 


r-2 
n — r — 1 
r — 1 


r—1 


1) - r- A 
r — 1 J 


The last equality follows from the well-known identity known as Pascal’s Rule i.e. ()() = + 

(fcZi); which holds for 1 < fc < n. This completes the proof. □ 


Consequently, the total number of convex characters on a tree P with each state appearing at 
least twice is independent of the topology of P. Specifically, we have: 



Corollary 2. For even n, 32 ( 11 ) = gpnjP). 

Proof. This is immediate by observing that Equation ([3]) can be obtained by substituting n/2 for 
n in Equation (P). □ 
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Let F{n) denote the nth Fibonacci number. That is, F{0) = 0, F{1) = 1 and for n > 2, 
F{n) = F(n — 1) + F(n — 2). For comprehensive background on Fibonacci numbers see [TT| . 

Theorem 3. For n>2, g 2 {n) = F{n — 1) and gi{n) = F{2n — 1). 


Proof. The following identity is classicaQ (n > 0): 



If we index k from 1 rather than 0 we obtain, 



Now, if we replace n with n — 2: 




= 92{n). 

The expression for gi(n) is then obtained by applying Corollary [2J 


□ 


The question arises whether the values g 3 {T,r) and/or 53 (T) share the topological neutrality 
of their gi and 172 counterparts. This is not the case: see Figured Here gaiTi) = 5, because 
53 {Ti,1 ) = 1, 93(71,2) = 3, 53 (Ti,3 ) = 1 and g 3 (Ti,r) = 0 (for r > 3). However, 33 ( 7 ^) = 6, 
because 33 ( 7 ^, 1) = 1, 33(7^, 2) = 4, 33 ( 7 /, 3) = 1 and 33 ( 7 ^, r) = 0 (for r > 3). 


h 



b 


T. 


T, 


2 


Figure 2 . The number of characters that are convex on 7 i with each state 
appearing on at least 3 taxa, is not the same as the corresponding number 
for 72 i.e. 33 ( 7 i) ^ 33(72)- This is because 33(71,2) = 3 (the charac¬ 
ters abc\def ghi, abodef\ghi and abcghi\def) while 33(72,2) = 4 (the characters 
abc\defghi,abcd\efghi,abcde\fgh and abcdef\ghi). Hence topology does play a 
role here, contrasting with the situation for 31 and 32. 


^This is usually attributed to Edouard Lucas (1876). By applying Pascal’s Rule and some algebraic manipulation 
it can be proven by induction. 
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4. Computing gk { T ) (and listing its elements) with dynamic programming 

The results from the previous section give rise to a number of questions. Can we compute gk(T) 
in polynomial time, for fc > 3? Also, if we want to explicitly list all the elements counted by gk { T ) 
[k > 1), is it possible to achieve this in some reasonable total running time e.g. 0((ifc(T)-poly(n))? 
We show that, for all fc > 1, the answer to both questions is yes. Specifically, we show how to 
compute gk{T) using dynamic programming, for each fc > 1. The combinatorial recurrence within 
the dynamic programming will also allow us to derive a polynomial-time computable bijection from 
{1,2,..., 5 fc(T)} to the characters counted by gk { T ). Using this bijection it is then straightforward 
to list (or sample) these elements. Note that this is also an advance for gi(T), since the recurrence 
used in m to derive gi(n) is based on inclusion-exclusion: it is not obvious how to transform it 
into a bijection. 

We begin by rooting T by subdividing an arbitrary edge with a new vertex and (implicitly) 
directing all edges away from this new vertex. The new vertex becomes the root of the tree. Note 
that this rooting operation has no impact on the convexity of characters, and the location of the 
root is irrelevant; it is simply a convenience which ensures that the term “child” is well-defined. 
The dynamic programming works bottom-up, from the leaves towards the root. 

Here it is helpful to represent a character / on AT as a set of non-overlapping, non-empty 
subsets {Hi,..., Bt} that partition X, where each Bi corresponds to a state. We also need some 
new definitions. A character / is valid for gkiT) if 

- / is convex on T, and 

- for each Bi S /, \Bi\ > k. 

Consider an ordered pair (/, B) where / is a character on X and B G f. We call such a pair a 
character-root pair of T if 

- / is convex on T, and 

- there exists a convex extension of / in which the root of T is assigned state B. 

Equality between character-root pairs is defined strictly i.e. (/, B) = (/', B') if and only if / = /' 
and B = B'. We say that a character-root pair {f,B) of T is semi-valid for gkiX) if, for each 
Bi G f, Bi ^ B ^ \Bi\ > k. Note that if a character-root pair (/, H) is semi-valid for gk{T), then 
/ is valid for gk { T ) if and only if \B\ > k. 

At each vertex u of the tree we will compute and store the following k \ values, where Bu is 
simply the subtree rooted at u\ 

~ gkiBu), 

- for each 1 < m < fc — 1 the number h{Bu, rn) which is defined as the number of character- 
root state pairs {f,B) of Tu such that both the following conditions hold: {f,B) is semi- 
valid for gk{Tu) and \B\ = m. We also store h{Tm k) which is defined slightly differently: 
we replace the term |H| = m with \B\ > k. 

If It is a taxon, then: 

- gk{Tu) is equal to 1 if fc = 1, and 0 if fc > 1, 

- h{Tu, rn) is equal to 1 if m = 1 and 0 if m > 1. 

We show how to compute these values recursively, assuming the corresponding values have 
already been computed for 71 (the subtree rooted at the left child of u) and Tr (the subtree rooted 
at the right child of u). First, 

(4) gk{Tu) = gk{Ti)gk{Tr) + ^ h{Ti,i)h{Tr,j). 

l<i,j<k 

The idea behind this recurrence is that characters counted by gk{Bu) can be created in two ways: (i) 
by taking the union of a character from the left subtree with a character from the right subtree, and 
(ii) by taking a character-root pair (/, B) from the left subtree, a character-root pair (/', B') from 
the right subtree, and then merging the root states to yield a character (/\H)U(/'\H')U(HUH'). 
Characters from the subtrees can only be used for (i) if they are already valid (with respect to gk) 
in their subtrees. The characters / and /' that are used for (ii) might not be valid with respect 
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to their subtrees, but we do require that they can be combined to obtain a character that is valid 
for gk{Tu)- This is possible if and only if {f,B) and are semi-valid for their respective 

subtrees and the sum of the cardinalities of B and B' is at least k. See Figure |3] for an example. 



Figure 3. The character abc\d is not valid for gs{Ti), and efg\hi is not valid for 
gaijr), but the character abc\dhi\efg is valid for g 3 {Tu)- The validity is obtained 
by allowing d and hi to merge, which was possible because they could both “reach” 
the roots of their respective subtrees. 

Second, for 1 < m < fc — 1 we have, 

(5) h{Tu,m) = gk{Ti)h{Tr,rn) + h{Ti,rn)gk(J'r) + ^ h{Ti,i)h{Tr, j). 

l < i , j < k -2 

Note that here m < k — 1, which means that the semi-valid character-root pairs {f,B) counted 
by this recurrence are such that / is not valid for gk{Tu)- The first two terms of the recurrence 
concern the situation analogous to (i) above. Specifically, in this case we assume that no states are 
merged, so a new semi-valid character-root pair can be created for 7^ if and only if it is constructed 
from the combination of a valid character from one subtree, with a semi-valid character-root pair 
from the other. The summation term corresponds to (ii). That is, we only count combinations of 
character-root pairs from the two subtrees if the cardinality of the merged state is exactly m. 
Finally we have, 

(6) h{J-u,k)=gk{Tl)h{J'r.k)+h{ri,k)gk{Tr)+ ^ h{Tl,l)h{rr,3). 

2+J>/c 

This final recurrence is semantically very similar to the previous one. The main difference is 
that it counts all semi-valid character-root pairs (/, B) for gk{Tu) such that / is valid for gkiju)- 

For a given vertex u, Equation 0 can be computed in 0(kF‘) time, assuming the values for 
71 and %■ have already been computed earlier. The same time bound holds for Equations ([5]) 
(for a specific 1 < m < fc — 1) and ([6|). Equation ([5]) has to be computed for each m, yielding a 
naive running time bound of O(fc^) (per vertex u), but this can easily be improved to 0(k‘^) by 
observing that a single 1 < i, j < k sweep over the h{Ti,i) and h[Tr,j) values can be recycled 
for computation of all the different h{Tu,m) values. There are 2(n — 1) vertices in the tree. This 
yields the following theorem. 

Theorem 4. Let T be an unrooted binary tree on n taxa. For each 1 < fc < n, gkiT) can be 
eomputed in 0{k‘^ ■ n) time and 0(k ■ n) space. 

Corollary 5. Let T be an unrooted binary tree on n taxa. For each 1 < fc < n, all the characters 
that are counted by gkiT) can be generated in OigkiT) ■kF' -n) total time, and a character counted 
by gkiT) can be sampled uniformly at random in 0(fc^ ■ n) time and Oik ■ n) space. 

Proof. Critically, there is no inclusion-exclusion involved Equations (|4]), ([5|) and dH). This allows 
us to impose a canonical ordering on the characters (and character-root pairs) counted by these 
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equations. For example, within Equation (|3]) we can choose to place the type-(i) characters earlier 
in the ordering than the type-(ii) characters. Within the gk{Ti)gk(Tr) type-(i) characters we can 
refine the order as follows: the first character from the left subtree combined with in turn each 
of the gkifTr) characters from the right subtree, then the second character from the left subtree 
combined in turn with each of the gkOV) characters from the right subtree, and so on. Once 
a canonical ordering has been chosen and the dynamic programming has been completed, we 
can start at the root of T and (using the g and h values computed at all vertices of the tree) 
recursively backtrack to generate the uniqely defined ith character. Hence, we obtain a bijection 
from {1, 2,... , 5 fe(T)} to the characters counted by 5 fc(T). The time and space requirements for 
backtracking through the tree (i.e. evaluating the bijection for a given element of {1, 2,..., ( 7 fc(T)}) 
are dominated by the time and space requirements of executing the original dynamic program, 
which are 0{k^ ■ n) and 0{k ■ n) respectively. This bijection can then be used to list all the 
characters counted by gkiT) or to sample uniformly at random from this space. □ 


We have implemented the dynamic programming (and the corresponding algorithms for listing 
and sampling) in Java and this can be downloaded from http: //skelk. sdf-eu. org/convexcount. 

Finally within the section, for an unrooted binary tree T on n leaves, we define the g-spectrum 
as simply the vector ( 5 i(T), 52 (T),... ,gn{T)). It is natural to ask whether two trees on n taxa 
have the same g-spectrum if and only if they are isomorphic (see e.g. [3] for related discussions of 
“identifiability”). Using the code above we have verified that, while this claim is true for n < 10 
leaves (see the software website for a proof), a counter-example exists for n = 11, see Figure IH 




T 


2 


Figure 4 . These two trees on 11 leaves are non-isomorphic but have the same 
g-spectrum: (10946,55, 8, 3, 2,1,1,1,1,1,1). 


5 . Algorithmic applications 

One of the advantages of expressing gi(n) and g 2 (n) as Fibonacci numbers is that it allows 
us to give tight bounds on their rate of growth. This can be particularly useful when bounding 
the running time of algorithms. Consider the following classical, closed-form expression for the 
Fibonacci numbers (n > 0), where (j) = ~ 1.618... is the golden ratio: 


F{n) 


( 5 ” 1 

75^2 
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(It is obtained from Binet’s Formula F{n) = -—— (1843) by observing that the term {—(j)) " 
is vanishing.) Combining with Theorem[3]we obtain 


9i{n) 

52 (n) 


1 

^ ^ 2 _ ■ 


Using asymptotic notation, it is clear that there are convex characters and con¬ 

vex characters in which each state occurs on at least two taxa. We give two examples of how 
these insights yield non-trivial exponential-time algorithms for two NP-hard problems arising in 
phylogenetics. 

5.1. Computation of m^Lximum parsimony distance. Let 7i,72 be two unrooted binary 
trees on the same set of taxa X. The metric c^mp (7i, 72 ) (the maximum parsimony distance of 
71 and 71) is defined as follows, where / ranges over all characters on X and lf{.) is as defined in 
Section 

dMp(Ti,71) = max|l/(Ti) - ^/(71)| 

It is NP-hard to compute dMP [SlIS]. It can be used to quantify the dissimilarity of two phylogenetic 
trees and is a lower bound on the similarly NP-hard tree bisection and reconnection (TBR) distance, 
denoted dxBR [I]- 

Theorem 6. Given two unrooted binary trees 71,72 on the same set of taxa X, where |7f| = n, 
dMp(71,72) can he computed in time ■ n^), where (p ~ 1.618... is the golden ratio. 

Proof. In [a IS] it is proven that the optimum is achieved by some character / that is convex on 
71 or 71 and where each state in the character occurs on at least two taxa. Hence simply looping 
through all the characters counted by 32(71) and, separately, all the characters counted by 32(71) 
is sufficient to locate an optimal character. Note that l/(.) can be computed in 0{n?) time using 
Fitch’s algorithir0 [7] or dynamic programming. Hence, scoring each character / can easily be 
performed in quadratic time. The result then follows by leveraging Corollary [5] □ 

We have implemented the dup algorithm in Java and for an exponential-time algorithm the 
results are encouraging; the code is freely available at http: //skelk. sdf-eu. org/convexmpdist. 
On a single 32-bit 1.66GHz Intel Atom (N450) processor the algorithm terminates for n = 20, 25,30 
in less than I second, 3 seconds and 51 seconds respectively. On a more powerful 64-bit 3.I0GHz 
machine the previously fastest algorithm, the Integer Linear Programming (ILP) approach de¬ 
scribed in [9], took 70 seconds to terminate on 12 taxa, and stalled completely on trees with more 
than 16 taxa, even using state-of-the-art ILP software. The enhanced range of our software has 
been recently used in experiments to verify that dMP is often a very good lower bound on c^mp 

m- 

5 . 2 . Computation of TBR distance and maximum agreement forests. Finally, we note 
that the results in this article also give an easy (although, in some cases, somewhat crude) upper 
bound on the number of agreement forests of two unrooted binary trees 7i, 72 on n taxa. Recall 
that, for an unrooted binary phylogenetic tree T on X and X' C X, T\X' is defined to be 
the unrooted binary phylogenetic tree on X' obtained by taking the minimal subtree of T that 
connects A', and suppressing vertices of degree 2. An agreement forest is a partition of X into 
non-empty subsets Ai,..., A^ such that (i) within 7i (respectively, T 2 ) the minimal connecting 
subtrees induced by the Xi are vertex-disjoint and (ii) for each A^, Ti\Xi = T 2 \Xi (where here 
equality explicitly takes the taxa into account). See [I] and recent articles such as [4] for further 
background on agreement forests. A maximum agremeent forest is an agreement forest with a 

^Fitch’s algorithm has running time 0{ns) where n is the number of taxa and s is the number of states in the 
character. In our context s can rise to 0{n). 
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minimum number of components, and this minimum is denoted (Imaf- Note that due to part 
(i) of the definition every agreement forest induces a character that is convex on both 71 and T 2 
(althoMh not all characters that are convex on both 7i and T 2 necessarily correspond to agreement 
foresttlj)- Hence there are at most agreement forests, which is 0(2.619"). It is well-known 

that cItbr is equal to the number of components in a maximum agreement forest, minus 1 [1]. 
Hence, again by leveraging Corollary [5] we obtain: 

Theorem 7. Given two unrooted binary trees 71,72 on the same set of taxa X, where |7f| = n, 
dTBR(7i,72) = dMAF^Ti^Tz) — 1 Can be computed in time 0(2.619" • poly(n)). Moreover, all 
agreement forests of 71 and 71 can be listed in the same time bound. 

6 . Conclusion 

A number of interesting open problems remain. For example, can we characterize non-isomorphic 
trees that have the same ^-spectrum? For a given fc > 3 and n, can we give analytical lower and 
upper bounds on gk(T), ranging over the space of all trees T on n taxa? 
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^However, if 7i = 72 then convex characters and agreement forests are related one-to-one. 



